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Abstract. Given a positive integer n and a positive semidefinite matrix 
A = (Aij) £ R mxm , the positive semidefinite Grothendieck problem with 
rank-n-constraint (SDP n ) is 



maximize Aij Xi ■ Xj, where x\, . . . , x m £ S n 1 . 

»=1 2=1 

In this paper we design a randomized polynomial-time approximation 
algorithm for SDP„ achieving an approximation ratio of 

2 / r((n+l)/2) \ 2 _ 1 ( 



^=n{ r(n/2) = 1 -^M 

We show that under the assumption of the unique games conjecture the 
achieved approximation ratio is optimal: There is no polynomial-time 
algorithm which approximates SDP n with a ratio greater than 7(n). 
We improve the approximation ratio of the best known polynomial-time 
algorithm for SDPi from 2/tt to 2/ (ny(m)) = 2/tv + 0(l/m), and we 
show a tighter approximation ratio for SDP„ when A is the Laplacian 
matrix of a graph with nonnegative edge weights. 
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1 Introduction 



Given a positive integer n and a positive semidefinite matrix A = (Aij) G R" 1 * 7 ™, 
the positive semidefinite Grothendieck problem with rank-n- constraint is defined 
as 

{mm "\ 
Xi ■ Xj : xi, . . . ,x m G S*™ -1 L 
4=1 j = l ' 

where 5 n_1 = {a; G R n : x ■ x — 1} is the unit sphere. Note that the inner 
product matrix of the vectors x±,... ,x m has rank n. This problem was intro- 
duced by Brief, Buhrman, and Toner [5] in the context of quantum nonlocality 
where they applied it to nonlocal XOR games. The case n = 1 is the classical 
positive semidefinite Grothendieck problem where x\, . . . , x m G { — 1, +1}. It was 
introduced by Grothendieck [7] in the study of norms of tensor products of Ba- 
nach spaces. It is an NP-hard problem: If A is the Laplacian matrix of a graph 
then SDPi(A) coincides with the value of a maximum cut of the graph. The 
maximum cut problem (MAX CUT) is one of Karp's 21 NP-complete problems. 
Over the last years, there has been a lot of work on algorithmic applications, 
interpretations, and generalizations of the Grothendieck problem and the com- 
panion Grothendieck inequalities. For instance, Nesterov [18] showed that it 
has applications to finding and analyzing semidefinite relaxations of nonconvex 
quadratic optimization problems. Ben-Tal and Nemirovski [3] showed that it 
has applications to quadratic Lyapunov stability synthesis in system and control 
theory. Alon and Naor [3] showed that it has applications to constructing Sze- 
meredi partitions of graphs and to estimating the cut norm of matrices. Linial 
and Shraibman |15) showed that it has applications to finding lower bounds in 
communication complexity. Khot and Naor [T^], [T3] showed that it has appli- 
cations to kernel clustering. For other applications, see also Alon, Makarychev, 
Makarychev, and Naor [2], and Raghavendra and Steurer [2"U] . 

One can reformulate the positive semidefinite Grothendieck problem with 
rank-n-constraint as a semidefinite program with an additional rank constraint: 

m m 

maximize ^ ^AjjXjj 

»=1 3=1 

subject to X = (Xij) G R mxm is positive semidefinite, 
Xu = 1, for i = 1, . . . , m, 
X has rank at most n. 

When n is a constant that does not depend on the matrix size m there is no 
polynomial-time algorithm known which solves SDP„. It is also not known if 
the problem SDP„ is NP-hard when n > 2. On the other hand the semidefinite 
relaxation of SDP„(A) defined by 

s m m n 

SDP 00 (A) = maxj ^ ^ A^ m ■ Uj : u 1( . . . , u m G S°° > 
^i=i j=i > 



can be computed in polynomial time to any desired precision by using, e.g., the 
ellipsoid method. Here S°° denotes the unit sphere of the Hilbert space Z 2 (R) 
of square summable sequences, which contains R™ as the subspace of the first 
n components. Clearly, it would suffice to use unit vectors in R m for solving 
SDP 00 (A) when A £ R""*" 1 , but using S°° will simplify many formulations 
in this paper. Rietz |21j (in the context of the Grothendieck inequality) and 
Nesterov |18) (in the context of approximation algorithms for NP-hard problems) 
showed that SDPi and SDPqo are always within a factor of at most 2/tt from 
each other. That is, for all positive semidefinite matrices A £ R mxm we have 

SDPi (A) 2 
f > ^— £- > -. (1 

By exhibiting an explicit series of positive semidefinite matrices, Grothen- 
dieck [7] (see also Alon and Naor [3J Section 5.2]) showed that one cannot im- 
prove the constant 2/tt to 2/tt + e for any positive e which is independent of to. 
Nesterov jf 8] gave a randomized polynomial-time approximation algorithm for 
SDPi with approximation ratio 2/7T which can be derandomized using the tech- 
niques presented by Mahajan and Ramesh [TB]. This algorithm is optimal in the 
following sense: Khot and Naor [12] showed that under the assumption of the 
unique games conjecture (UGC) there is no polynomial-time algorithm which 
approximates SDPi to within a ratio of 2/7T + e for any positive e independent 
of to. The unique games conjecture was introduced by Khot |10j and by now 
many tight UGC hardness results are known, see e.g. Khot, Kindler, Mossel, 
and O'Donnell [IT] for the maximum cut problem, Khot and Regev [14] for the 
minimum vertex cover problem, and Raghavendra |19j for general constrained 
satisfaction problems. The aim of this paper is to provide a corresponding anal- 
ysis for SDP„. 



Our results 

In Section [2] we start by reviewing our methodological contributions: Our main 
contribution is the analysis of a rounding scheme which can deal with rank-n- 
constraints in semidefinite programs. For this we use the Wishart distribution 
from multivariate statistics (see e.g. Muirhead [T7]). We believe this analysis is of 
independent interest and will turn out to be useful in different contexts, e.g. for 
approximating low dimensional geometric embeddings. Our second contribution 
is that we improve the constant in inequality (JTJ slightly by considering functions 
of positive type for the unit sphere S 1 " 1-1 and applying a characterization of 
Schoenberg [22]. This slight improvement is the key for our UGC hardness result 
of approximating SDP„ given in Theorem [3] We analyze our rounding scheme 
in Section [3] 

Theorem 1. For all positive semidefinite matrices A £ R mxm we have 



SDP„M) 

1 > > 

~ SDPoo(A) " 



and there is a randomized polynomial-time approximation algorithm for SDP„ 
achieving this ratio. 

The first three values of 7(n) are: 

7(1) = 2/tt = 0.63661... 
7(2) =tt/4 = 0.78539... 
7 (3) = 8/(3tt) = 0.84882... 

In Section U we show that one can improve inequality ([1]) slightly: 

Theorem 2. For all positive semidefinite matrices A G ^ mxm we have 

SDPiQA) 2 _mf T(m/2) \ 2 _2 fl\ 

~ SDPoolA) " 7r 7 (m) 7T ^T((m+ l)/2)/ tt + W ' 

and there is a polynomial-time approximation algorithm for SDPi achieving this 
ratio. 

With this, the current complexity status of the problem SDPi is similar to the 
one of the minimum vertex cover problem. Karakostas [3] showed that one can 
approximate the minimum vertex cover problem for a graph having vertex set 
V with an approximation ratio of 2 — 0(l/ylog \V\) in polynomial time. On the 
other hand, Khot and Regev [Tl] showed, assuming the unique games conjecture, 
that there is no polynomial-time algorithm which approximates the minimum 
vertex cover problem with an approximation factor of 2 — e for any positive e 
which is independent of \V\. In Section [S] we show that the approximation ratio 
7(71) given in Theorem[T]is optimal for SDP„ under the assumption of the unique 
games conjecture. By using the arguments of the proof of Theorem [5] and by the 
UGC hardness of approximating SDPi due to Khot and Naor [T2] we get the 
following tight UGC hardness result for approximating SDP„. 

Theorem 3. Under the assumption of the unique games conjecture there is no 
polynomial-time algorithm which approximates SDP„ with an approximation ra- 
tio greater than "f(n) + e for any positive e which is independent of the matrix 
size m. 

In Section[B]we show that a better approximation ratio can be achieved when 
the matrix A is the Laplacian matrix of a graph with nonnegative edge weights. 



2 Rounding schemes and functions of positive type 

In this section we discuss our rounding scheme which rounds an optimal solution 
of SDPoo to a feasible solution of SDP„. In the case n — 1 our rounding scheme 
is equivalent to the classical scheme of Goemans and Williamson [5] . To analyze 
the rounding scheme we use functions of positive type for unit spheres. The 
randomized polynomial-time approximation algorithm which we use in the proofs 
of the theorems is the following three-step process. The last two steps are our 
rounding scheme. 



1. Solve SDP oc (A), obtaining vectors u\, . . . , u m £ S™ 1 ^ 1 . 

2. Choose X = (Xij) £ l" xm so that every matrix entry Xij is distributed 
independently according to the standard normal distribution with mean 
and variance 1, that is, X^ ~ iV(0, 1). 

3. Set x t = Xui/\\Xui\\ £ S 71 - 1 with i = l,...,m. 

The quality of the feasible solution x±, . . . , x m for SDP n is measured by the 
expectation 



E 



EE a 

i=l j=l 



m m 



1 = 1 j=l 



Xuj 

TxaZ. 



which we analyze in more detail. 
For vectors u, v £ S°° we define 

E n (u,v)=E 



Xu 
JXu\\ 



Xv 



Xv 



Xu-; 



\Xuj 



(2) 



where X = (Xy) is a matrix with n rows and infinitely many columns whose 
entries are distributed independently according to the the standard normal dis- 
tribution. Of course, if it, v £ S™" 1 , then it suffices to work with finite matrices 
X £ W lXm . 

The first important property of the expectation E n is that it is invari- 
ant under O(oo), i.e. for every m it is invariant under the orthogonal group 
O(m) = {T £ M raxm : T T T = I m }, where I m denotes the identity matrix. More 
specifically, for every m and every pair of vectors u,v £ g" 1-1 we have 



E n (Tu, Tv) = E n (u, v) for all T £ O(m). 



If n = 1, then 

Ei(u, v) = E[sign(^ • u) sign(^ • v)}, 

where f £ R m is chosen at random from the m-dimensional standard normal 
distribution. By Grothendieck's identity (see e.g. jH Lemma 10.2]) 

2 

E[sign(£ • u) sign(^ • v)] = — arcsinu • v. 

TT 

Hence, the expectation E\ only depends on the inner product t = u ■ v. For 
general n, the O(oo) invariance implies that this is true also for E n . 

The second important property of the expectation E n (now interpreted as 
a function of the inner product) is that it is a function of positive type for 
S* 00 , i.e. it is of positive type for any unit sphere S"™" 1 , independent of the 
dimension m. In general, a continuous function / : [— 1, 1] — > R is called a 
function of positive type for S m ~ l if the matrix (f(vi ■ Vj))i<ij<N is positive 
semidefinite for every positive integer N and every choice of vectors v±, . . . , vn £ 
gm-i rpj^ ex p ec tation E n is of positive type for S°° because one can write it 



as a sum of squares. Schoenberg 22 characterized the continuous functions 
/ : [—1,1] — > R which are of positive type for S°°: They are of the form 



/(*)=£/***» 



i=0 



with nonnegative /j and X}i=o /•> ^ 00 • ^ n ^ ne case B = 1 we have the series 
expansion 



, . 2 2 ^ (2t)! 

^(f) = -arcsini = > 

7T TT ^ ' 



f2i+l 



7T 7T^ 2 2i (i!) 2 (2i+ 1) 

In Section [3] we treat the cases n > 2. 

Suppose we develop the expectation £7 n (t) into the series E n (t) = ^2°1 fit 1 . 
Then because of Schoenberg's characterization the function t \-> E n (t) — fit is of 
positive type for S* 00 as well. This together with the inequality J2i j ^ij^ij — Oj 
which holds for all positive semidehnite matrices 1,7 6 R mxm , implies 

m rn m rn 

SDP„(A) > A » E n(ui, Uj) > fx £ u * • M J = /i SDP oo(^)- (3) 

i—1 j — 1 i=l j=X 

When n = 1 the series expansion of E\ gives j\ — 2/t: and the above argument 
is essentially the one of Nesterov [18] . To improve on this (and in this way to 
improve the constant 2/tt in inequality ([I])) one can refine the analysis by working 
with functions of positive type which depend on the dimension m. In Section 0] 
we show that t n> 2/7r(arcsint — t/ r y(m)) is a function of positive type for S™ 1 ^ 1 . 
For the cases n > 2 we show in Section [3] that fi = ~f(n). 



3 Analysis of the approximation algorithm 



In this section we show that the expectation E n defined in ([2]) is a function of 
positive type for S°° and that in the series expansion E n {t) — 5Z°f. one 
has /i = "f(n). These two facts combined with the discussion in Section [5] imply 
Theorem [TJ Let u, v G be unit vectors and let X = (Xy) G R™ xm be 

a random matrix whose entries are independently sampled from the standard 
normal distribution. Because of the invariance under the orthogonal group, for 
computing E n (u, v) we may assume that u and v are of the form 

u = (cos 0, sin 0, 0, ... , 0) T 
v = (cos 0,- sin 0,0, ...,0) T . 

Then by the double- angle formula cos 20 = t with t = u-v. 
We have 



Xu 



( -X"ll X\2 



\X n l Xra2y 



lsin0 



, Xv 



( X\\ ^12 



\X n l X n 2j 



costf 
— sin ( 



Hence, 

Xu Xv _ x T Yy 
\\Xu\\ ' JXvJ\ - ^/( x TYx)(y T YW 

where x — (cos 9, sin 9) T , y — (cos 9, — sin 9) T , and Y e M 2x2 is the Gram matrix 
of the two vectors (Xu, . . . , X n i) T , (X12, . . . , X n 2) T G M™. By definition, Y is 
distributed according to the Wishart distribution from multivariate statistics. 
This distribution is defined as follows (see e.g. Muirhead [32]). Let p and q be 
positive integers so that p > q. The (standard) Wishart distribution W q (p) is the 
probability distribution of random matrices Y = X T X 6 M 9X,J , where the entries 
of the matrix X — pQj) S R px<? are independently chosen from the standard 
normal distribution ~ N(0, 1). The density function of Y ~ W g (p) is 

/9 L e -^(y)/2 (dety)(p - g -i)/2 

2Pi/ 2 r q (p/2) y ' 

where 7^ is the multivariate gamma function, defined as 

r q (x) = n^-^f[r(x- 1 ^ 

We denote the cone of positive semidefinite matrices of size q x q by iS> ' In 
our case p = n and q = 2. We can write E n (t) as 

£„(*) = - ^ /ON / ; ^ = e - Tr ( y )/ 2 (detr)("- 3 )/ 2 dy, 

reV ; 2«r 2 (n/2) 7 s|o Vt^^d/^!/) V ' 

where t = cos 29, and cc as well as y depend on 9. The parameterization of the 
cone S> given by 

rt2 f^A /f + acos0 asin</> \ rn . r . , 

Slo = { 1- = ( 2 Q sin / a _ a c ; s J : G [0, M , a e [0, a/2] , a G R>„ 

allows us to write the integral in a more explicit form. With this parametrization 
we have 

a 2 

Tr(y) = a, det(F) = — — a 2 , dY = a dcf>dada, 



and 



t„ erf , 
x Yy = — + acos0, 

x T yx = — + a(t cos + 2 sin 9 cos sin 1 

y T Yy = — + a(t cos cfr — 2 sin 6* cos sin <; 



So, 
£»(*) 



a/2 />27r at 



^ + a COS ( 



2 n r 2 (n/2)J J Q J ^(1 + at cos 0) 2 - a 2 (l - t 2 )(sin0) 2 

/ 2 \ (n-3)/2 

-a/2 ( H__ a 2\ a d(t>dada ^ 



Substituting a = (a/2)r and integrating over a yields 

2"" 1 r 2 (n/2) Jo J ^/(l + rtcos^-r^l-^Xsin^) 2 

Using Legendre's duplication formula (see [TJ Theorem 1.5.1]) r(2x)r(l/2) 
2 2x - 1 r{x)r{x + 1/2) one can simplify 

r(n) n — 1 



2™- 1 r 2 (n/2) 2tt ' 

Recall from ([3]) that the approximation ratio is given by the coefficient fx in the 
series expansion E n (t) = J2iLo Now we compute f\\ 

n-1 Z" 1 Z" 2 " r(l - r 2 )^" 1 )/ 2 



2tt 7 7o (1 -r 2 (sin0) 2 ) 3 / : 



Using Euler's integral representation of the hypergeometric function [TJ Theorem 
2.2.1] and by substitution we get 

n - 1 [ 2 * r(l)r((n+l)/2) / 3/2, 1 2 



2^ 7 2P((n + 3)/2) 4 L \{n + ?,)/2' 

Vx( ( ^%%;t)(i-t)-^*. 



4^ T((n + 3)/2) 7 
»-ir((n + l)/2)l 



TT T((n + 3)/2)2y 

This simplies futher by Euler's generalized integral [TJ (2.2.2)], and Gauss's sum- 
mation formula [TJ Theorem 2.2.2] 

n -ir(( n + l)/2)f(l/2)r(l/2) / 3/2, 1, 1/2 
Jl 2^ T((n + 3)/2) r(l) 3 2 V(n + 3)/2,l' 
_ n-lf((n + l)/2) / 3/2,1/2 \ 
2 T((n + 3)/2) 2 ^(n + 3)/2' ^ 
_ n - 1 T((n + l)/2) r((n + 3)/2)r((n - l)/2) 
2 T((n + 3)/2) r(n/2)r((n+2)/2) 

_ 2 /T((n + 1)/2) N " 



n \ r{n/2) 



4 Improved analysis 

Nesterov's proof of inequality (JTJ relies on the fact that the function t h-> 
2/7r(arcsini — i) is of positive type for S 100 . Now we determine the largest 



value c(m) so that the function t i— ¥ 2/7r(arcsini — c{m)t) is of positive type 
for S 1 '™ -1 . By this we improve the approximation ratio of the algorithm given 
in Section [5] for SDPi from 2/ir to (2/tt)c(iti). The following lemma showing 
c(m) = 1/7(771) implies Theorem [5] 

Lemma 1. The function 

2 / t 
1 1->- — arcsin i 



7T \ 7( TO ), 
is of positive type for S m . 

Proof. We equip the space of all continuous functions / : [—1,1] — > R with the 
inner product 

(f,9)a = J J(t)g(t)(l-t 2 ) a dt, 

where a = (m — 3)/2. With this inner product the Jacobi polynomials satisfy 
the orthogonality relation 

where P^ Q,Q ' is the Jacobi polynomial of degree i with parameters (a, a), see e.g. 
Andrews, Askey, and Roy Q]. Schoenberg [35] showed that a continuous function 
/ : [—1,1] — » R is of positive type for g" 1 " 1 if and only if it is of the form 

00 

i=0 

with nonnegative coefhcients fi such that fi < °°- 

Now we interpret arcsin as a function of positive type for g" 1-1 where m 
is fixed. By the orthogonality relation and because of Schoenberg's result the 
function arcsin t — c{m)t is of positive type for S 1 " 1-1 if and only if 

(arcsin t - c(m)t, P^ a) ) a > 0, for all £ = 0, 1, 2, . . . . 

We have p[ a,a \t) = (a + l)t. By the orthogonality relation and because the 
arcsin function is of positive type we get, for i 1, 

(arcsin t — c(m)t, P[ a ) a = (arcsin t, P- a '°^) a > 0. 

This implies that the maximum c(m) such that arcsin?; — c(m)t is of positive 
type for S™ -1 is given by c(m) = (axcsint, t) a /(t, t) a . 
The numerator of c(m) equals 



(arcsin t,t) a = J arcsin(rj)r;(l — t 2 ) a dt 



tt/2 

0sin0(cos6O 2Q+1 d6) 

-7T/2 

r(l/2)r(a + 3/2) 
(2a + 2)r(a + 2) ' 



The denominator of c(m) equals 

where we used the beta integral (see e.g. Andrews, Askey, and Roy 1, (1.1.21)]) 

rl t 2 *-\l-t 2 y- 1 dr= r \ S in6) 2x - 1 (cos6) 2y - 1 d6 = F{x ) r ^ 
Jo 2r(x + y) 

Now, by using the functional equation xT(x) = r(x + 1), the desired equality 
c(m) = 1/7(771) follows. □ 



5 Hardness of approximation 

Proof (of Theorem^. Suppose that p is the largest approximation ratio a poly- 
nomial-time algorithm can achieve for SDP„. Let Hi,... ,u m € S n be an ap- 
proximate solution to SDP n (A) coming from such a polynomial-time algorithm. 
Then, 

rn m 

Ui ■ uj > pSBP n {A). 

i=i j=i 

Applying the rounding scheme to u\, . . . ,u m 6 .S™ -1 gives xi,...,x m 6 { — 1, +1} 
with 



E 



m m 



m m 



Aij x,[Xj = — Aij arcsin Ui ■ Uj 

%=i j=i J i=i j=i 



> _^-SDP n (A), 
irj(n) 

where we used that the matrix A and the matrix 

Ui ■ Uj 



{I ( arCSi 



arcsin Ui ■ Uj 



l<i,j<m 



are both positive semidefmite. The last statement follows from Lemma [T] applied 
to the vectors ui, . . . ,u m lying in S" l_1 . Since SDP„(A) > SDPi(A), this is a 
polynomial-time approximation algorithm for SDPi with approximation ratio 
at least (2p)/(TTj(n)). The UGC hardness result of Khot and Naor now implies 
that p < 7(n). □ 



6 The case of Laplacian matrices 

In this section we show that one can improve the approximation ratio of the al- 
gorithm if the positive semidefmite matrix A = (Aij) S K mxrn has the following 



special structure: 

Ay<0, iii^j, 

n 

Ay = 0, for every j = 1, . . . , n. 

i=l 

This happens for instance when A is the Laplacian matrix of a weighted graph 
with nonnegative edge weights. A by now standard argument due to Goemans 
and Williamson [6 shows that the algorithm has the approximation ratio 

. \ \-E n (t) 
v(n) = mm < — — : t G [— 1, 1 

To see this, we write out the expected value of the approximation and use 
the properties of A: 



n n 

' ^ ] AijXi ■ x^J = ^ ^ AijE n (ui ■ Uj) 



E, , / 1 - E k (ui ■ uj) \ 



n 

> u(n) A. t jUi ■ Uj 
= «(n) SDP^CA). 

The case n = 1 corresponds to the MAX CUT approximation algorithm of 
Goemans and Williamson [6]. For this we have 

v(l) = 0.8785 . . . , minimum attained at to — —0.689 .... 

We computed the values v(2) and w(3) numerically and got 

v(2) = 0.9349 . . . , minimum attained at to = —0.617 . . . , 
v{3) = 0.9563 . . . , minimum attained at to = —0.584 .... 
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